Generate robust weights for points excluded by UV-cut. #225

JSKenyon · 2023-01-13T10:18:21Z

@landmanbester Would appreciate your thoughts here. Current behaviour results in weights of zero for points excluded by the UV-cut. This has repercussions for MAD flagging as points with zero weight will end up flagged (whitened residuals will be zero). These changes will result in robust weights being produced for points excluded by the UV-cut. My intuition is that this is sensible - we are applying the gains to those points after all.

landmanbester · 2023-01-13T10:50:34Z

I think this is about the most sensible thing you can do. Also shows why flagging on the whitened residuals is the way to go. If you flag on just the residuals when there is unmodelled flux present you run the risk of i) biasing your MAD estimates and ii) flagging unmodelled flux. You may also want to have an option where points within the baseline cut are not considered during MAD flagging (I am thinking of the case where you smoove (not a typo) the gains and flag on the whitened residuals using the original weights)

JSKenyon · 2023-01-13T11:26:33Z

Excluding points in the UV-cut during MAD flagging was the cause of some of @o-smirnov's problems. If a baseline is excluded, it will not have a mad estimate at which point it becomes a little unclear what to do with it i.e. you either have to flag it entirely (this is incorrect and what was previously happening), or ignore it which means that then the flagging on the short baselines may be bad. For now I think that ignoring the UV-cut in both the reweighting and the MAD flagging is sensible. After all, even if there is bad data there it will not affect the gains and ultimately we probably do want to flag/downweigh bad data. All of this is up for debate though - we can modify as needed.

o-smirnov · 2023-01-13T14:11:21Z

For now I think that ignoring the UV-cut in both the reweighting and the MAD flagging is sensible.

Hmm I think this is slightly different from the CC madmax behaviour, which has a separate residual flagging round at the end (enabled by --madmax-residuals) that flags all residuals, including those excluded by the cut (since these are computed anyway, as they should be).

But now that I think about it, there are use cases for both behaviours:

I routinely use a mild uv-cut (100m or so) to avoid contamination in the gains from leftover RFI on short baselines. However in this case I still want to do MAD flagging on those short baselines.
If the image was constructed with a uv-cut/inner taper (as we now try to do for Sun-contaminated images), the model on short spacings is invalid. But now I don't want to do MAD flagging on them, because it would presumably flag an excessive amount of data. @Victoria-Samboco's solar imaging pipeline, for example, should operate in this mode, because she's going to rephase and image the Sun later.

JSKenyon · 2023-01-16T10:32:07Z

* If the image was constructed with a uv-cut/inner taper (as we now try to do for Sun-contaminated images), the model on short spacings is invalid. But now I _don't_ want to do MAD flagging on them, because it would presumably flag an excessive amount of data. @Victoria-Samboco's solar imaging pipeline, for example, should operate in this mode, because she's going to rephase and image the Sun later.

Interesting. I could add options for finer control over this behaviour. However, my instinct is that this is a dangerous regime to be using the MAD flagger in anyway. Fundamentally, the MAD flagger assumes that the statistics of the whitened residual (although this may need to be the whitened corrected residuals now that I think about it) are Gaussian. This assumption is only truly valid once our model is complete and our data is adequately calibrated. Prior to that, the residuals are actually student-t with an unknown DOF parameter. This means that we need to be very careful when MAD flagging (particularly if we have no weights/our weights are incorrect) or else we can introduce weird biases which can manifest as ghosts. It should still catch gross outliers but using it at the outset will almost definitely cause problems.

Generate robust weights for points excluded by UV-cut.

c17c9f8

JSKenyon mentioned this pull request Jul 6, 2023

Investigate interaction of robust reweighting, uv-cut and writing of corrected weight. #283

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate robust weights for points excluded by UV-cut. #225

Generate robust weights for points excluded by UV-cut. #225

JSKenyon commented Jan 13, 2023

landmanbester commented Jan 13, 2023

JSKenyon commented Jan 13, 2023

o-smirnov commented Jan 13, 2023

JSKenyon commented Jan 16, 2023

Generate robust weights for points excluded by UV-cut. #225

Are you sure you want to change the base?

Generate robust weights for points excluded by UV-cut. #225

Conversation

JSKenyon commented Jan 13, 2023

landmanbester commented Jan 13, 2023

JSKenyon commented Jan 13, 2023

o-smirnov commented Jan 13, 2023

JSKenyon commented Jan 16, 2023